Example-Based Grapheme-to-Phon
نویسندگان
چکیده
Several characteristics of the Thai writing system make Thai grapheme-to-phoneme (G2P) conversion very challenging. In this paper, we propose an Example-Based Grapheme-toPhoneme conversion approach. It generates the pronunciation of a word by selecting, modifying and combining pronunciations from syllables from training corpus. The best system achieves 80.99% word accuracy and 94.19% phone accuracy which significantly outperform previous approaches for Thai.
منابع مشابه
Improving recognition of proper nouns in ASR through generating and filtering phonetic transcriptions
Accurate phonetic transcription of proper nouns can be an important resource for commercial applications that embed speech echnologies, such as audio indexing and vocal phone directory lookup. However, an accurate phonetic transcription is more difficult o obtain for proper nouns than for regular words. Indeed, phonetic transcription of a proper noun depends on both the origin of the peaker pro...
متن کاملExample-based grapheme-to-phoneme conversion for Thai
Several characteristics of the Thai writing system make Thai grapheme-to-phoneme (G2P) conversion very challenging. In this paper, we propose an Example-Based Grapheme-toPhoneme conversion approach. It generates the pronunciation of a word by selecting, modifying and combining pronunciations from syllables from training corpus. The best system achieves 80.99% word accuracy and 94.19% phone accu...
متن کاملAutomatic generation of phonological variations
A recognition system must include structures which accoupt f?r ~he di~feren~ aspects of. possible phonological vanab1hty m the mcommg speech s1gnal. To aid in dealing with variability at the phonological level, a grapheme-toseveral-phoneme strings module, V ARIONO, has been developed at LIMSI. Tests of this system showed the necess1ty of creating a hierarchical structure to order phon<?lo~ica~ ...
متن کاملIntegrating Thai grapheme based acoustic models into the ML-MIX framework - for language independent and cross-language ASR
Grapheme based speech recognition is a powerful tool for rapidly creating automatic speech recognition (ASR) systems in new languages. For purposes of language independent or cross language speech recognition it is necessary to identify similar models in the different languages involved. For phoneme based multilingual ASR systems this is usually achieved with the help of a language independent ...
متن کاملPhon: Free Software for Phonological Transcription and Analysis
1. OVERVIEW. Phon is an open-source program for the transcription and analysis of phonological and phonetic data. It was designed to help systematize research in children’s phonological development, but many functions in Phon, particularly the powerful search function, can be used for a wide range of investigations in phonetics and phonology. Phon is compatible with other language processing pr...
متن کامل